The use of F0 reliability function for prosodic command analysis on F0 contour generation model

نویسندگان

  • Mitsuru Nakai
  • Hiroshi Shimodaira
چکیده

This paper describes a method of utilizing an “F0 Reliability Field” (FRF), which we have proposed in our previous work, for estimating prosodic commands on F0 contour generation model. This FRF is the time-frequency representation of F0 likelihood, and an advantage of FRF is that it is not necessary to consider F0 errors that occur during an automatic F0 determination. Therefore, it is thought that FRF can be a more useful feature for automatic prosody analyses than F0 contour, and our previous paper has reported the validity of FRF on the analysis of detecting prosodic boundaries in Japanese continuous speech. Moreover, in this paper, we have examined the validity on the prosodic command estimation of superpositional model. Experimental results show that the accuracy of command estimation with FRF is well and it is close to the accuracy of command estimation with ideal F0 contour that has no F0 error.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Corpus-based generation of prosodic features from text based on generation process model

A total scheme of generating prosodic features from a text input was constructed. The method consists of corpus-based prediction of pauses, phone durations and fundamental frequencies (F0's), in this order, and information predicted in an earlier process is utilized in the following processes. Since prediction of F0's is done on the command values of F0 contour generation process model instead ...

متن کامل

Corpus-based Generation of F0 contours of Japanese based on the Generation Process Model and its Control for Prosodic Focus

A total corpus-based process of generating prosodic features form text is developed. The process first predicts pauses and phone durations, and then generates F0 contours. Since F0 contour generation is based on the generation process model, it is rather easy to manipulate the generated F0 contours in command level. A method was developed for generating sentence F0 contours, when a focus is pla...

متن کامل

An F0 Contour Model in Chinese Based on Templates of Prosodic Words

The problem of F0 contour generation in Chinese are addressed in this paper. An F0 contour model based on templates of prosodic words is proposed. Taking templates of prosodic word F0 contour as the basic units, the basic structure of the model is established with references to the “small ripples on top of big waves theory” and “Fujisaki model”. A three-layer prosodic hierarchy which consists o...

متن کامل

Corpus-based synthesis of fundamental frequency contours based on a generation process model

A mode-constrained corpus-based synthesis strategy was developed for fundamental frequency (F0) contours of Japanese sentences. In the training phase, the relationship between linguistic factors and the command values (amplitudes and locations) of F0 contour generation process model was learned for a prediction module; a neural network in the current paper. Input parameters consist of linguisti...

متن کامل

Restoration of pitch pattern of speech based on a pitch generation model

In this paper a model-based approach for restoring a continuous fundamental frequency (F0) contour from the noisy output of an F0 extractor is investigated. In contrast to the conventional pitch trackers based on numerical curve-fitting, the proposed method employs a quantitative pitch generation model, which is often used for synthesizing F0 contour from prosodic event commands for estimating ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998